Lesson: Using Data Cleanse Transforms 


Italian, Spanish, or Portuguese. These Data Cleans transforms include all required options 
except input fields. 


Data Cleanse Transform Configuration 


Configure options on the Data Cleanse transform to fit your operational data project. You 
need to set several options: 


e Import the ATL file transferred from Cleansing Package Builder. Importing the ATL file 
brings the required information and automatically sets the following options: 


Cleansing Package 
Engine 

Filter Output Fields 
Input Word Breaker 
Parser Configuration 


e Inthe input schema, select the input fields that you want to map and drag them to the 
appropriate fields in the /nput tab. 


Name and firm data can be mapped either to discrete fields or multiline fields. 
Custom data must be mapped to multiline fields. 


Phone, date, e-mail, Social Security number, and user-defined pattern data can be 
mapped either to discrete fields or multiline fields. The corresponding parser must be 
enabled. 


e Inthe Options tab, select the appropriate option values to determine how Data Cleanse 
processes your data. 


The Cleansing Package option 
The Parser_Sequence_Multiline options 


e Inthe Output tab, choose the fields that you want to output from the transform. In 
Cleansing Package Builder, output fields are referred to as attributes. Make sure that you 
map any attributes (output fields) taken from user-defined patterns in Cleansing Package 
Builder reference data. 


Parsing Results Improvement 

The Cleansing Package Builder module of Information Steward is required starting with Data 
Services 4.0 to modify or customize any type of data. The dictionary menu has been removed 
from the Data Services Designer menu bar. Data Cleanse no longer requires a separate 
cleansing package repository. 
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